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Abstract 

We discuss several possible caveats which arise with the interpretation of measurements of fluctu- 
ations in heavy ion collisions. We especially focus on the ratios of particle yields, which have been 
advocated as a possible signature of a critical point in the QCD phase diagram. We conclude that 
current experimental observables are not well-defined and are without a proper quantitative meaning. 
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I. INTRODUCTION 



The investigation of fluctuations and correlations, i.e. an event- 
energy heavy ion collision data has been ongoing for several years 



w-event analysis, of high 



cussion was refreshed 
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14| . Recently, this dis- 



2l| in the view that (event-by- event) fluctuations of multiplicity, or 



of ratios of multiplicities, might serve as an observable to pin down the critical behavior of the 
collective, and potentially at some point of its evolution thermalized, system created when two 
heavy ions collide. Thus, the measurement of ratio fluctuations might make it possible to fix the 
values of temperature and baryon chemical potential of a critical point (see e.g. 22, |23|) in the 
phase diagram of Quantum Chromo Dynamics (QCD). In this paper we discuss the problems 
and possible caveats of such an analysis. 

An event-by-event analysis ultimately deals with (joint) distributions of a certain set of 
observables. Parameters such as mean value, variance, skewness and kurtosis can be employed 
to describe the shape of a distribution obtained. Their values, as the distribution obtained itself, 
will depend on many experimental factors. More complicated measures are often lacking a clear 
interpretation. In particular we will investigate the commonly used measure of dynamical ratio 
fluctuations. 

It is not the aim of this paper to suggest an improved observable or method capable of 
locating a critical point of strongly interacting matter, or one that could potentially be indicative 
of new physics. The goal of this paper is to discuss caveats of the existing methods which are 
usually not mentioned in the literature. The manuscript is organized as follows. In Section [III 
we formulate some rather general statements about the particularities of an event-by-event 
analysis. In Section IIIII we then focus on a particular method and discuss some difficulties one 
might face with its interpretation. A Discussion of specific experimental and theoretical studies 
can be found in Section IIVI Some concluding remarks are summarized in Section |Vl 



II. GENERAL CONSIDERATIONS 

Let us start by making a rather general observation. The statistical properties of a sample of 
events depend on the rules chosen to select this sample from an even larger sample of events, and 
on the degree of completeness of the information available about each member of this sample. 
In the context of heavy ion collision physics these two aspects translate into centrality class 
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construction from minimum bias data, and an experiments capabilities of particle identification 
and momentum measurement. We will elaborate on these aspects in the following. 



A. Centrality Selection 

We first discuss centrality selection. Two heavy ions collide with relativistic momenta. Being 
extended objects, they can do so in a variety of different ways. Roughly the following rule should 
apply (in the average sense): The larger the interaction region, the more particles of each species 
are produced. However, the initial state of the collision cannot be observed directly. All that 
can be observed is the final state. From this one can then infer the likelihood of a certain initial 
state. Yet, each single possible final state observable will generally suggest a slightly different 
initial state. Hence, the need arises to average over centrality classes. The problem is then: 
Within any such a centrality class will be events with rather different initial states, altering the 
true correlation between two observables. Results will depend on, in particular, which trigger 
was chosen to construct these centrality classes, or sub-samples, of events. 
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FIG. 1. (Color online) Illustration of centrality selection being done in two different ways, once via the 
number of charged particles in the final states (-^^h^) ^^^'^ once via the number of wounded nucleons 
(Np). This leads to two different sub-samples, despite the fact that the same centrality class (20 — 22%) 



was constructed. The different colors depict the probability. The figure is taken from 



24|. 



To illustrate this point we refer to Fig. [H Shown is a result of a HSD transport simulation. 



taken from 



2J]. The two observables shown in Fig. [T] are the number of participating (or 
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wounded) nucleons Np, and the multiplicity of charged final state hadrons N^^^-^ in a mid- 



but serves to illustrate 



rapidity acceptance window. This analysis is somewhat hypothetical 
a crucial point: Depending on which one of the two observables is used to define the sample 
of 20 — 22% most central events, two rather different sub-samples, indicated by the vertical and 
horizontal lines, are created. These two samples could possibly have rather different statistical 
properties. The two methods are hence not equivalent, and both introduce their very own bias. 
It deserves to be stressed that the smaller the centrality class intervals (here 2%) are chosen 
the more distinct are the sub-samples obtained from two different trigger conditions. 

B. Acceptance in Momentum Space 

The second aspect we want to discuss is particle acceptance. By this we mean particle 
identification, momentum measurement, and ultimately the geometric coverage of the interac- 
tion region by the detector. We ignore for now centrality selection and think of a perfectly 
prepared initial state, and explore two limiting cases. The first one being the ideal detector. 
All final state particles are observed, and any correlation can be measured to any degree. The 
opposite limit would be a very bad detector, capable of only detecting a particle every once 
in a while. Such a detector could surely measure the ratios of the occurrence of particles of 
different species (provided sufficient particle identification capabilities) and hence allows for a 



comparison of model results for ratios of averaged particle multiplicities to actual data |25H27|. 
However it would be completely unable to inform us how the multiplicities of different parti- 
cle species within one event are correlated or how this correlation would change in different 
segments of momentum space. Any realistic detector is in between these limits. 

Also here Fig. [1] can serve as an illustration. Changing the acceptance window for charged 
final state hadrons N^^;^ , which was for the purpose of this analysis located around mid-rapidity 
(while assuming that Np can still be measured with the same accuracy), will change the 
marginal distribution P^N^'^J) = J2np ^^^p^ -^Ih^)- -'■^ particular mean value, variance, skew- 
ness, and kurtosis of the marginal distribution P{N^jJ) are affected. The distribution P{N^h^) 
is often used as a reference distribution for construction of centrality classes by collider exper- 
iments. That means, depending on the value N^^^-^ of a particular event, one assigns this event 
to a certain centrality class. 



^ As the number of participating nucleons is not accessible to experimental measurement. 
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III. FLUCTUATIONS OF THE RATIO OF TWO OBSERVABLES 



Above considerations served to emphasize the importance of the way by which a certain data 
set was constructed and the degree of completeness of the information available about each 
element of ones data set. These two aspects also determine whether two data sets can really be 
compared to each other. We now focus on one method, the measurement of the fluctuations of 
the ratio of particle multiplicities, while leaving it to the reader to decide whether or not some 
aspects of this discussion could be shared by other methods. 



A. Data Distribution 



Firstly a plain observation: independent of a particular analysis technique, what is considered 
here is the ratio of two observables. Each of which will have a certain value in each event. 
Directly (as done e.g. in the STAR experiment 



28j) or indirectly (as done e.g. in the NA49 



experiment 29|]) one can obtain then the ratio of these two observables. In the following we 
will focus on the numbers of particles of a certain species in each event. 

We again consider construction of centrality classes and the window of particle acceptance 
available. In general one can conclude that the joint (2 dimensional) distribution Pdata{Ni, N2) 
of the multiplicities of the two species '1' and '2' of particles will be determined by the choices of 
solutions to the two aforementioned aspects, i.e. acceptance and centrality selection. The dis- 
tribution of ratios, Pdata{Ni / N2) , being constructed from the joint distribution, is then certainly 
no exception. 

A measure for quantifying the width of the distribution of ratios Pdata{Ni/N2), similar to 



the ones used by the STAR 28|] and NA49 collaboration 29|], could read: 



Here {{ANiY) denotes the variance of observable iVj, {Ni) denotes the mean value of the 
observable N^, and (AA'iAA^2) denotes the co- variance of the observables A''i and N2. 

The above discussion about centrality selection and particle acceptance certainly does not 
do justice to all the technical details which will ultimately determine the shape of the joint 
distribution Pdata{Ni, N2) of the two observables A'^i and N2. Yet, all five terms, which make 
up the measure depend strongly on many of these aspects. Hence, there is no reason to believe 



that the measure cx^, given in Eq. [H would not depend on them. 



B. Mixed Event Background 

The need arises then to remove the bias introduced. The mixed event background subtrac- 
tion should remove all "trivial" correlations due to acceptance and centrality selection, while 
leaving all "physical" correlation unchanged. Note, that different mixing procedures have been 



suggested [30|], but since they do not change the argumentation of our analysis we do not com- 
ment on them separately. Before focusing on the mixed event background, let us make another 
observations about the measure cx^. This measure, being so general, could be applied to any 
sample, as long as the observables A'^i, N2 are positive integers. Independent of what A'^i and N2 
may represent. This implies that also the event mixing procedure, once it is specified, could be 
applied to a more general (or plainly differenlo) system with observables A'^i and N2. 

Let us consider a particular event mixing procedure similar to the one used by the NA49 
collaboration. Firstly a reference distribution Pdata{N) of the observable N = Ni + N2 is 
constructed: 

TV 

PdataiN) = Pdata{N,,N-Ni) . (2) 

Afi=0 

A mixed distribution Pmix{Ni,N2) can be obtained by sampling the reference distribu- 
tion Pdata{N) and assigning the values for A'^i and N2 according to a binomial distribution. 
The mixed event background distribution is then: 



Pmix{N^.N2) = q'''{l-qr' jPdata{Nr + N2) , (3) 

where q = (Ni) / {(Ni) + is the probability that a randomly drawn particle is of species '1'. 
The distributions PdataiN) and Pmix{N) are then identical. 

Stated differently, once the distribution Pdatai.Ni, N2) is chosen (or fixed), then also the 
new mixed background PmixiNi, N2) is similarly determined. This is to say, for any generic 
distribution P{Ni,N2) a mixed background and the measure of dynamical ratio fiuctuations 

'^dyn ~ '^data ~ '^mix i'^) 



^ An example, unrelated to physics, could come to mind: The number of kindergartens and supermarkets 
in every city and town could be evaluated. Groups of cities sorted according to some criterion, like their 
estimated number of citizens, would form centrality classes. Our ability to accurately determine the number 
of kindergartens and supermarkets would correspond to particle acceptance. This might not be a particular 
thoughtful example, yet it shows that such an ana|^sis is not restricted to heavy ion physics. And one still 



are equally determined. This is independent of what A'^i and N2 represent, and independent of 
how the original data set came about. 



C. Illustrations 

We are considering the example of a Bi-variate Normal Distribution (BND), with five pa- 
rameters: marginal variances ((AA^i)^)^^^, and {{AN2)'^)data, their co- variance {ANiAN2)data, 
and two mean values, {Ni)data, and {N2)data- Having specified these values, and having decided 
which event-mixing procedure should be applied, the measures, (j'^^.tay ^mix^ ^lym be 
straightforwardly obtained. This implies the existence of a four- dimensional hyper-surface in 
this parameter space representing BNDs with an equal amount a^^^ of dynamical ratio fluctu- 
ations. 

Let us explore the limit of a BND, Fig. [2] {left), with completely correlated observables A^i 
and A^2- The correlation coefficient 

{AN,AN2) 

P = I (5) 

between them is then Pdatai^i, N2) = 1. The ratio of A'^i and here in this example is then 
always equal to unity. The same will not be true for the mixed event background, Fig.^{right), 
where the correlation will be weaker, Pmix{Ni, N2) < 1. Following above event mixing proce- 
dure, one then quickly finds cr'^ata — 0' '^mix > 0' ^^'^ ^"dyn < 0- words, despite the fact that 
the ratio is always one, the measure, discussed in Eqs. [1] and HI suggests a degree of negative 
dynamical ratio fiuctuations for this distribution. 

Also other limiting cases could be explored. Here we mention the class of BNDs with 
co-variance (AA'^iAA^2) = 0, as illustrated in Fig. [21 The distribution Pdatai^i, N2) then fac- 
torizes into the product of two Normal distributions Pdata{Ni, N2) = Pdata{Ni)Pdata{N2). For 
the example shown in Fig. [31 a BND with mean values {Ni)data = {N2)data = 50, and co- 
variance {ANiAN2)data = was chosen. In Fig. [3] the resulting co- variance of the mixed event 
background {ANiAN2)mix (left) and the measurqj of dynamical ratio fiuctuations adyn (right) 
are shown in their dependence on the variances {{ANi)'^)data and {{AN2Y) data- In general one 
finds {ANiAN2)mix 7^ after event mixing, i.e. the mixed event background now contains 
correlations, which the original sample did not. 



Following the convention we present this number as adyn = sign(CT^ ) \ \a'\yj\ ■ 100 %. 
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FIG. 2. (Color online) Illustration of a perfectly correlated data distribution (left). The mixed event 
background (right) shows a wider distribution. The measured dynamical ratio fluctuations associated 
with the original distribution are then negative. 




FIG. 3. (Color online) Illustration of uncorrelated ({ANiAN2)data = 0) data distribution leading 
to a correlated {{ANiAN2)mix 7^ 0) mixed background (left) and resulting dynamic ratio fluctua- 
tions adyn (right). 

IV. DISCUSSION 

Having formulated rather general statements about the particularities of an event-by-event 
analysis of heavy ion collision data, and having discussed a specific fluctuation measure, we feel 
the need to add a few more specific comments. 



8 



Measurements of fluctuations and correlations of particle multiplicities are often mentioned 
with regard to the detection of a critical point in the QCD phase diagram or other predictions 
for the properties of yet to be observed new states of matter. Certain aspects of distributions are 
supposed to shed light on their (temporary) formation. Given the caveats discussed throughout 
this paper the authors are less optimistic. 

The effects discussed above are hard to quantify as this would have to be done specifically 
and systematically. The biases introduced by a particular centrality selection criterion and a 
given particle acceptance would have to be studied separately for each observable. Focus should 
be put on the different solutions realized in the two main groups of experimental set-ups, fixed 
target and collider geometries. This was not the aim of t his p aper. Yet some examples can be 



quoted. Here we want to refer to a recent review paper 17|, summarizing the PhD thesis of 



V. Konchakovski. In particular we mention Sections 6 and 8. 



The first example we want to quote is the correlation coefficient 3l|, l32] between the multi- 
plicities of charged hadrons in two narrow symmetrically around mid-rapidity arranged accep- 
tance intervals, usually termed 'forward-backward correlations'. These correlations have been 



studied in their dependence on choice of centrality trigger and size of centrality classes 



17 



24 



33] . A wider definition of centrality classes leads to a stronger correlation between forward and 
backward hemispheres. A larger system will have lots of particles in both acceptance windows. 
A smaller system will obviously have less particles. Combining systems of different sizes into 
one sample will lead to (strong) correlations in this sample from geometric considerations alone. 
A different choice of the centrality trigger will results in a different centrality dependence of 
ones observable. No unambigous (final state) measure is available to select only events of the 
'same (initial) system size' into one sample. 

For the second example we want to return to the fiuctuations of hadron multiplicity ratios. 
An HSD simulation of heavy ion collisions has been performed for several center of mass ener- 
gies. At low center of mass energy the data was analyzed twice. Once for the NA49 acceptance 
for 3.5% most central collisions selected via the projectile spectator nucleon signal, and once for 
the STAR acceptance for 5% most central collisions selected via the reference charged hadron 
multiplicity distribution. From above considerations differences between these two set-ups are 
to be expected. The situation is summarized as follows : For some ratios of the multiplicities 
of selected hadrons stronger differences between the two samples emerge than for others. Given 
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our difficulty to interpret the measure of dynamical ratio fluctuations, we flnd it hard to make 
a positive conclusion about the apparent numerical similarity of the results of the NA49 34 1 



and STAR 



35| collaborations. 



It is equally hard to disentangle different contributions to ones fluctuation and correlation 
signals. The fact remains that correlation must not be confused with causation. We cordially 
acknowledge the commonly quoted motivation for performing statistical analysis of heavy ion 
collision data. However the (conjectured) effects to be investigated, or hoped to be discovered 
and established, are not the only factors determining the statistical properties of ones data 
sample. 

The view that a (sharp) non-monotonic behavior of a suitably chosen observable evaluated 
on a suitably constructed and modifled data set can straightforwardly be taken as evidence for 
new physics (e.g. the detection of a critical point) is not held by the authors. 



V. SUMMARY 

This discussion largely fell into two parts, each of which tried to raise two points. Firstly the 
results of any event-by-event analysis do depend strongly on how one constructs sub-samples 
from minimum bias data. Minimum bias data from different experimental set-ups can vary 
quite strongly, and no initial state can be observed. Secondly, and equally important, is an 
experiments capability to record accurate and detailed information about each event. The 
patchier the data set the less detailed and conclusive any analysis of it, or comparison to 
another data set, can be. 

We then discussed a particular procedure. Also here we tried to raise two points. Firstly a 
lot of information is discarded in favor of a single number, cr^y^- This number might not be very 
indicative of a systems properties. Secondly, plainly constructing a reference distribution from 
a given data distribution according to some set of rules does not necessarily help to subtract 
correlations. Unfortunately no reference (data) sample (where a certain effect is simply not 
contained) is available in high energy heavy ion collision physics. 

Given above considerations we flnd it hard to compare and interpret results from different 
experiments. Especially the dependence on the center of mass energy of the colliding ions might 
then carry little information. 
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